Sequential vs. Hierarchical Syntactic Models of Human Incremental Sentence Processing
نویسندگان
چکیده
Experimental evidence demonstrates that syntactic structure influences human online sentence processing behavior. Despite this evidence, open questions remain: which type of syntactic structure best explains observed behavior–hierarchical or sequential, and lexicalized or unlexicalized? Recently, Frank and Bod (2011) find that unlexicalized sequential models predict reading times better than unlexicalized hierarchical models, relative to a baseline prediction model that takes wordlevel factors into account. They conclude that the human parser is insensitive to hierarchical syntactic structure. We investigate these claims and find a picture more complicated than the one they present. First, we show that incorporating additional lexical n-gram probabilities estimated from several different corpora into the baseline model of Frank and Bod (2011) eliminates all differences in accuracy between those unlexicalized sequential and hierarchical models. Second, we show that lexicalizing the hierarchical models used in Frank and Bod (2011) significantly improves prediction accuracy relative to the unlexicalized versions. Third, we show that using stateof-the-art lexicalized hierarchical models further improves prediction accuracy. Our results demonstrate that the claim of Frank and Bod (2011) that sequential models predict reading times better than hierarchical models is premature, and also that lexicalization matters for prediction accuracy.
منابع مشابه
Insensitivity of the human sentence-processing system to hierarchical structure.
Although it is generally accepted that hierarchical phrase structures are instrumental in describing human language, their role in cognitive processing is still debated. We investigated the role of hierarchical structure in sentence processing by implementing a range of probabilistic language models, some of which depended on hierarchical structure, and others of which relied on sequential stru...
متن کاملThe Integration of Syntax and Semantic Plausibility in a Wide-Coverage Model of Human Sentence Processing
Models of human sentence processing have paid much attention to three key characteristics of the sentence processor: Its robust and accurate processing of unseen input (wide coverage), its immediate, incremental interpretation of partial input and its sensitivity to structural frequencies in previous language experience. In this thesis, we propose a model of human sentence processing that accou...
متن کاملModeling the effects of memory on human online sentence processing with particle filters
Language comprehension in humans is significantly constrained by memory, yet rapid, highly incremental, and capable of utilizing a wide range of contextual information to resolve ambiguity and form expectations about future input. In contrast, most of the leading psycholinguistic models and fielded algorithms for natural language parsing are non-incremental, have run time superlinear in input l...
متن کاملCanonicity Effect on Sentence Processing of Persian-speaking Broca’s Patients
Introduction: Fundamental notions of mapping hypothesis and canonicity were scrutinized in Persian-speaking aphasics. Methods: To this end, the performance of four age-, education-, and gender matched Persian-speaking Broca's patients and eight matched healthy controls in diverse complex structures were compared via the conduction of two tasks of syntactic comprehension and grammaticality jud...
متن کاملTowards a Neuro-Cognitive Model of Human Sentence Processing
A formal sentence processing system is proposed which simulates different eventrelated potential (ERP) elicitation between sentences with and without unambiguous case marking. The electroencephalographical data are based on German subordinate clauses and Japanese sentences. As a formal framework we adopt Dynamic Syntax (Kempson et al. 2001), which enables incremental update of information by un...
متن کامل